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Abstract 

We study a process where balls are repeatedly thrown into n boxes independently according 
to some probability distribution p. We start with n balls, and at each step all balls landing in 
the same box are fused into a single ball; the process terminates when there is only one ball 
left (coalescence). Let c := '^jPj, the collision probability of two fixed balls. We show that 
the expected coalescence time is asymptotically 2c~ ^ , under two constraints on p that exclude 
a thin set of distributions p. One of the constraints is c <C In"'^ n. This ln~^ n is shown to be a 
threshold value: for c S> ln~'^ n, there exists p with c(p) = c such that the expected coalescence 
time far exceeds . Connections to coalescent processes in population biology and theoretical 
computer science are discussed. 
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68W40, 92D25. 
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Remark. All limits in this paper are taken as n ^ oo, and we use Landau notation o, w, O in the 
usual sense. We say that an event A holds with high probability (whp) if P{A'^) = o(l), and we 
denote the set {1,2,..., n} by \n]. 

1 Introduction 

We consider the following balls-into-boxes process. Let p = (pi, . . . ,p„) be any probability vector. 
At time t = Q, start with 6o balls and throw them into n boxes, where each ball has probability 
Pj of landing in box j, independently of all other balls. Fuse all balls that land in the same box, 
and then repeat the allocation at times t ~ 1,2,... according to the same rules with this possibly 
smaller new number of balls. The random time T at which all balls are first fused into a single one 
is called the coalescence time; we will be mainly interested in its asymptotic expected value. 

This problem has been studied in various guises by numerous authors. Initially, it was stated in 
terms of finding the Most Recent Common Ancestor (MRCA) in a random genealogical process. We 
will restrict ourselves to the balls-into-boxes formulation here, but it is important to note that any of 
the results we obtain here can be recast in the language of population biology; we will occasionally 
provide the reader the appropriate analogy. The seminal work in this area was done by Kingman 
[71 [SI [S] who, for fixed &07 proved convergence of the underlying process to the continuous-time 
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coalescent process, thereby establishing convergence of the distribution of T/E[T]. In [7] he also 
proved that for bo ^ n and the uniform distribution p = . . . , 

E[T] < 2n- 2, 

which effectively implies that 

E[T]^2n, n^oo. (1.1) 

More recently, Donnelly and Tavare [3], Mohle [T2], and Mohle and Sagitov [13] have studied the 
limiting behavior of more general classes of allocation (reproduction) models for bo fixed. In [T3] . 
Mohle considered the case 6o — > oo as well for several models, which however do not include the one 
at hand. 

There is a good deal of literature on continuous-time coalescent processes that bears mentioning. 
For large times t, when the number of balls is small, we should expect relatively long time intervals 
during which no collisions happen, likely punctuated by binary (one-on-one) collisions. This behav- 
ior is characteristic of Kingman-type coalescent processes, and admits a natural time scaling to a 
continuous-time process. More recently, the theory of A- and 2-coalescents, developed by Pitman, 
Sagitov, and Schweinsberg [151 [191 |2Qj , among others, allows for models involving multiple simulta- 
neous collisions. Indeed, in our model, for small times t, we are likely to have multiple simultaneous 
collisions, however these happen at fixed time intervals. There is therefore no natural time-scaling 
that can be performed in order to interpret this behavior in the limit as a continuous-time process 
with random collision times. 

The process can also be described in terms of compositions of random functions: choose random 
functions {fs '■ [n] — *■ [nJjsgN independently, in such a way that for all i € [n], and for all s € N, 
fs{i) = j with probability pj, independently for all i and s. The coalescence time T is then the 
smallest value of t for which ft ° ■ ■ ■ ° f2 ° fi is a constant function. It is this formulation that 
has been used in connection with computer science: this problem is potentially useful in bounding 
the running time of so-called 'Coupling from the Past' (CFTP) algorithms introduced by Propp 
and Wilson [17l [T^; see Section [6] for a brief discussion of this. Motivated by this connection, 
and apparently unaware of Kingman's work for the uniform distribution, Dalai and Schmutz [2] 
established ([LT|); Fill [4] and Goh et al. [6] derived the hmiting distribution of T/E[T]. 

For bo = n, Adler et al. [H Theorem 4] were able to extend Kingman's result (|l.ip to a 
nonuniform p, showing that 

E[T]^2c^\ C2:^Y.Pv (1-2) 

3 

(note that C2 = 1/n for the uniform p), under the condition 

^<^, c,:=Y^p]. (1.3) 
C2 n ^-^ ■' 

Here C2, C3 are the probabilities of a double collision and a triple collision respectively. In essence, 
(|1.3p means that p is sufficiently close to (1/n, . . . , l/n). 

We should expect the largest contribution to the time T to happen during the late stages of the 
process, when the number of balls is relatively small. In this case, any reduction in the number of 
balls will most likely be due to the collision of a single pair of balls. This explains the appearance 
of C2 in (jl.2p . (In fact, C2 had been used as a scaling parameter for a wide class of models by Mohle 
in [12] in the context of population genetics.) The bound (|1.3|) on C3 ensures that, if in one of those 
late stages the number of balls has dropped, then the actual decrease is exactly 1 with conditional 
probability sufficiently close to 1. 

The proof in Ij revealed that the expected time spent in the late stages was about 2c^^ under 
conditions far less restrictive than (|1.3p . (II. 3p was used in |1J to show that the expected time spent 
in the early stages was o(c^^). 

In this paper we prove that this property of the process and p.2p continue to hold for a much 
wider class of distributions p. Here is our main result. 
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Theorem 1.1. Let bo ^ n. Suppose that for some £ > however small, 

C2 = o (ln"2 n) and C3 < n. (1.4) 

Then (i) 

S[T] = 2c2"^ (1 + 0(1)) for 00, (1.5) 

and (a) 

where Yfc are independent and exponentially distributed, P{Yk > x) = . 
Remarks. 

3/2 

1. It is always the case that C2 < 1 and C3 < ; so if not for the logarithmic factors, the 
conditions (|1.4p would not have excluded any probability vectors p. Further, 4^'^ /{n~^C2) > 
n^/^, since C2 > n^^ for all p. This means that the restriction C3/C2 < 3/n in [T] is much more 
stringent than C3 < Cj^^ In^'^-'^^'^-' n in our Theorem ll.il 

2. The lower bound in ([13]), E[T] > 2c-^{l - o{l)), holds for any p as long as C2 0. This can 
be deduced from [1] Theorem 2] via an elementary coupling argument; we will provide a brief 
proof in Section 2) Therefore, our main task is to prove a matching upper bound. 

3. For the case of the uniform p = (1/n, . . . , 1/n), ()1.6p follows from Kingman's work. Indepen- 
dently, it was proved later — in a setup close to that of our paper — by Fill |4j; still later Goh 
et al. [6] gave a detailed description of the cdf of the limiting distribution. 

4. Part (ii) of Theorem 1 1 . 1 1 follows from part (i), via an argument very similar to Fill's proof for 
the uniform case; cf. Theorem 6.1 of Mohle [T3] . 

Interestingly, ln~^ n appearing as the upper bound for C2 in Theorem 1 1.1 1 is a genuine threshold 
for the property "_B[T] is of order exactly": 

Theorem 1.2. Let bo = n. For 02 — lo (ln~^ n) , there exists a probability vector p with = C2 

such that with high probability, T — uj {c2^) : (md thus E^T] — uj {c^^^ ■ 

Adler et al. [U Theorem 5] had proved that E[T] 00 for a probability vector satisfying 
conditions ensuring that limc2 > 0. 

The rest of the paper is organized as follows. In Section [2l we bound the expected time spent 
during an early phase. We achieve this by showing that whp, for certain "small" values of t, the 
stochastic process is well-approximated by a deterministic process, amenable to sharp estimates. 
To that end we will build on the method used by Pittel [16] for asymptotic analysis of a rumor- 
spreading process introduced and studied by Frieze and Grimmett [S] . In Section [31 we bound the 
expected time spent in a "middle" phase and a "late" phase, showing that the late phase contributes, 
overwhelmingly, to the total number of steps. In section [4] we prove the lower bound for Theorem 
11.11 In Section[5]we prove Theorem ll.2l In the appendix we prove some auxiliary inequalities needed 
for the proof of Theorem 11.11 

2 The expected duration of an early phase 

Let the distribution p be given. Assume that &o — For t e N, we denote by B{t) the random 
number of balls at time t, so that -B(O) = n. In the language of the genealogical process, B{t) is the 
number of individuals at generation —t which have a descendant alive in the current generation 0. We 
denote by T(fc) the random first time t when B{t) falls below fc, i. e. r(fc) = min{i > : B{t) < k}. 
The coalescence time T is therefore r(l). Obviously, {B{t)}t>o is a Markov chain on the state space 
{1, 2, . . . , n}, so we will refer to B{t) as the state at time t. 
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Notice that, by the definition of the stochastic sequence {-S(t)}t>o, 

n 

E[B{t + l)\B{t) = k]=Y,[l-{l-pj)>'] , B{0)=n, 

j=i 

because the probability of box j receiving at least one ball out of k allocated balls is 1 — (1 — pj)''. 
It would seem natural to try to prove that the conditional distribution of B{t + 1) is concentrated 
around E[B{t+ l)\B{t) = fc], as long as k is large enough. Curiously, we will be able to show instead 
that for smallish t, whp B{t + 1) is relatively close to ^p{B{t)), where 

n 

$p(fc):=^(l-e-^'^'=). (2.1) 

Note that $p(S(t)) is close to E[B{t + l)\B{t)] when most of PjB{t) are small, which may not be 
the case when B{t) is relatively close to n. 

Here is an outline of our argument. We introduce A* = o{c2^), and a recurrence inequality 
which the random sequence B{t) is believed to satisfy whp as long as B{t) is above fc*. Assuming 
that the inequality does hold, we derive efhcient bounds for T(fc*). Lastly we show that indeed whp, 
{i?(t)}(<^(fe^) satisfies the recurrence inequality. 



2.1 Variational problems 

In order to determine the likely decline of {B{t)}, we need to boiuici $p(fc) from above for a certain 
range of k. This task seems quite hard, since ^p{k) depends on all n components of p in a rather 
complicated way. Remarkably, the worst case bounds will do the job quite efficiently, because the 
worst distribution p turns out to be much simpler than a feasible generic p. 

Let us define D„ to be the set of probability n- vectors. For any q e -D„, define 

n 

F^{k):=J2e-'"'', (2-2) 

so that 4'q(fc) = n — Fq{k), and set 

D{C2) := I q e £)„ X) «l = ^2 [ and D{c2, C3) := | q S D„ = C2, ^Qj ^csV 

That is, 0(02) (resp. £)(c2, C3)) is the set of probability vectors that share the same sum of squares 
(resp., squares and cubes) as p. When dealing with D{c2), 0(02,03), we will assume that 02 > 1/n; 
otherwise these sets are reduced to a point 1/n). The fimctional Fq{k) is continuous and 

the sets 0(02) and £)(c2,C3) are compact, so the infima of F (as a function of q, for fixed k) 
over these sets are attained. 



Proposition 2.1. For any k G [n], 



min Fq{k)=F^{k), 



where u is the uniform vector u = (1/n, . . . , 1/n). 
Proposition 2.2. For any k G [n], 

min Fq(A;) =Fe(c,)(fc), 

qe£'(c2) 

where 9 (02) = {9i,92, ■ ■ ■ ,0n) has the property that 61 > O2 = ■ ■ ■ = On (when listed in nonincreasing 
order). That is, 6 has only two distinct entries, and the larger one has support size equal to 1. We 
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will refer to such vectors as being of "topheavy" type. Using the equations 6i + {n — 1)^2 = 1 o,nd 
Of + {n — 1)02 = ^2, we can explicitly express the two entries of 6 as 



l + v/(n-l)(c2n-l) 

Z ' ^2 = -■ (^-i) 



Proposition 2.3. For any k £ [n], 

min i^q(fc) = i^r(c2,C3)(fc), 

qe-D(c2,c3) 

where r(c2,C3) := (ri,r2, . . . ,r„) (when ordered in nonincreasing order) has the following property: 
for some v £ [n], 

ri = ■ ■ ■ = > r^+i > r^+2 = ■■ ■ = r„. (2.4) 

That is, r(c2,C3) has at most 3 distinct entries, and the middle one (if any) has support size equal 
to 1. 

Clearly, D„ D D{c2) D £"(02,03), and so Fp(fc) > Fr(c2,c3)(fc) > ^e(c2)(fc) > ^u(fc)- These 
propositions therefore provide sharper and sharper estimates, so at various junctures we will use 
whichever one is easiest to work with, while still being sharp enough. 

Proof of Proposition \2J\ Using the fact that ^p{x) — e^^ is concave up, 

□ 



n \ ^ — ' n 

3 ^3 



Proofs of Propositions [KM and \2.'A To prove Proposition [^21 there are two steps: 

1. Show that a minimizer of -F'q(fc) on D{c2) cannot have a configuration Oj-^ > 9j,-^ > 9j^; that 
is, that it cannot have three distinct entries. 

2. Show that a minimizer 6 cannot have a Oj-^^ — > configuration {ji £ [n]), which will 
imply that the larger entry is unique. 



We will only prove Proposition 12.31 as it is more difficult; the interested reader would not find it 
difficult to adapt the argument to prove Proposition 12.21 

We may recast Proposition 12.31 as follows: letting Zj := qjk, we want to minimize G{z) := 



= 



k^C2, and ^z| = A:3c3. (2.5) 



Our task is to show that the minimizer x — (xi, . . . ,a;„) of G(z), with components listed in nonin- 
creasing order, must have the form xi — ■ ■ ■ = x^ > x^^i > x^+2 = ■ ■ ■ — Xn for some £ [n]. To 
this end, we show first that a minimizer of G(z) has at most four distinct components (Case I), and 
second, that an entry value which is strictly intermediate is encountered exactly once (Case II). Our 
proof does not rely on the method of Lagrange multipliers, because its applicability for the equality 
constraints needs a prior justification and because, in principle, it may deliver only a "first-order" 
necessary condition, definitely too crude to handle Case II. 

Case I. We first show that a minimizer of G(z) cannot have four distinct entries. Suppose 
for the sake of contradiction that we have a minimizing vector x for which there exist ji, j2, J3,i4 
(relabel as 1,2,3,4) such that xi > X2 > X3 > X4^ > 0. Let yj = Xj -\- Sj for j = 1, . . . ,4, and yj = Xj 
for j = 5, . . . , n; we will show that for a suitable choice of (ei, . . . , £4), y = (yi, . . . , j/„) satisfies the 
conditions (|2.5p . and G(y) < G(x), and thus such an x cannot be a minimizer on the set D{c2, C3). 
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First note that we require £4 > because of the possibihty that X4 — 0, but ei,£2,£3 can be of 
either sign. For y to satisfy the conditions (|2.5p . we require 



3E^ 



i=i 



i=i 

4 

E^ 

4 

E^ 



0, 



0, 



(2.6) 
(2.7) 
(2.8) 



Now we want G(x) — G'(y) > 0; by hnearizing the e factors, it wiU be sufficient (by taking the Sj 
as small as we wish) to show that 



"=^£2 + e~''^e3 + e 



-^''£4 > 0. (2.9) 

We now obtain expressions for the Sj. For given Zj, the system p.6p - (j2.8p is a system of 3 nonlinear 
equations in 4 unknowns £i,£2,£3,£4; treating £4 as a parameter, we hope to be able to solve it 
uniquely for £i,£2,£3 near (0,0,0)-^. Let e := (£1, £2, £3)"^, and write (|2.6p - (|2.8p as the vector 
equation 

f (e) = b(£4), b(£4) := (—£4, — 2x4£4 — £4, ~3x^£4 — 3x4£4 — el)'^ . 
The derivative (Jacobian) matrix of f at is 

/111 

L := 2xi 2x2 "2x3 

Its determinant is equal to 6A(a:i, a;2, 0:3), where A(a;i, 0:2, a^a) is the Vandermonde determinant for 

Xi,X2,X3, 

A(a;i,a;2,X3) = {x2 - xi){x3 - xi)(x3 - X2), 

which is non-zero (negative), as the Xi are distinct (decreasing). Therefore, by the Inverse Vector 
Function Theorem (IVFT), for |£4| sufficiently small there exists a differentiable solution e = £(£4), 
£(0) = 0, such that 

£ = 7^4 + 0(£|), 7 := L-^h'{0) = L-\~l, -2x4, Sxlf . 

Explicitly, by Cramer's rule, 

A(a;i,a;4,a;3) A(a;i, 2:2, ^4) 



A(x4,a;2,a;3) 
7i = -T7 72 



73 



A(a;i,X2,a;3) A(a;i, a;2, X3) A(xi,X2,a;3) 

With these formulas, ()2.9|) is equivalent to showing (by letting £4 > be as small as needed) 

e"^*A(xi,a:2,a;3) < e^^^ A(a;4, 0:2, 0:3) + e^'''' A{xi, x^, X3) + e^''" A{xi, X2, X4), 
which in turn is equivalent to showing that 











1 


1 


1 


1 


Xi 


X2 


2:3 


X4 










xl 


X2 


4 


X4 



> 0. 



By using the operations typical for computation of the Vandermonde-type determinants, we get 

-X2 _ p—Xi p-X3 _ „—Xi f,—Xi _ „—Xi 



D(X) 



e -"^ - e 

X2 - Xl 

2 2 

-^2 ~ •'^1 



X3 - Xl 
2 2 

Xo Xl 



X4 — Xl 



Ylixt-xi) X 

i=2 



\{xi - X2) - A(xi - X3) A(xi - X2) - X{xi - Xi) 

X3 - X2 X4~ X2 
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where A(a;) :— (e^ — Next, we factor X3 — X2 and — X2 from the first column and from the 

second colmTin. So, mtroducing 

4 4 



j=2 



i=3 



we then get 



Z?(x) - C(x) 
-C(x) 



A(zi— 3:2) — A(zi— 3:3) \{xi—X2) — \(xi — Xi) 
X3 — X2 X4—X2 
1 1 

X{xi — X2) — A(xi — X3) \[xi — X2) — \{xi — Xi) 



{xi - X2) ~ {xi - X4) 



(2.10) 



{xi - a;2) - [xi - xa) 
Now A(x) is concave up for x > 0. Therefore, since 

< xi — 2:2 < xi — ^3 < — a;4, 

the quantity in square brackets in (|2.10[) is strictly negative, by considering its terms to be slopes 
of secant lines to the graph of A(a;). Using this and C(x) < 0, we get the desired conclusion, i.e. 
D{ji.) > 0. 

Case II. Now we show that a vector x with a configuration xi > X2 — X3 > X4: > cannot be 
a minimizer of G either. Define x := X2 = X3. Now that f'(0) is singular, determination of small 
feasible ei, . . . ,£4 such that G(y) < G(x) is more of a challenge. The fact that the linear terms in 
(|2.6p - (|2.8p now depend on only ei,e2 + £3, £4 hints that |ei|, \e2 + £3], |£4| should be equally small, 
and that \e2\ and jeaj, while small, should be much larger. 

Believing in this scenario, we set 



ei = Sie , £2 = £ + <52£ 



£3 



£4 — 648 



and seek the feasible Si{e) for small e. To begin with, we again require S4 > 0. The conditions 
d^lll-llMl) become 



S1+S2 + S4 =0, 
XiSi + XS2 + X4S4 = — 1 + £^^2(£, S), 
xlSi + x'^S2 + ^4^4 = -2x + ebsie, d), 



(2.11) 
(2.12) 
(2.13) 



where the bi{e,S) are polynomials. Notice that A :— A{xi, x, X4), the determinant of the matrix 
in (|2.1ip - (|2.13p . is nonzero. So for |£| small enough, there exists a differentiable solution 6(e), such 
that (5(0) is the solution of (|2.1ip - (l2.13p with 0, —1, and —2x respectively on the right hand side. 
By Cramer's rule. 





-1 



1 1 



X X4 



-2x x^ 



Xl 



1 

— 1 X4 
~2x X ^ 



which gives 



5m = 



{x — X4)'^ 



(^2(0) 



(a:i — X4){xi + X4 ~ 2x) 



64(0) = - 



1 


1 





Xl 


x 


-1 


xl 


x^ 


-2x 


{x- 







Reassuringly, ^4(0) is positive (because A is negative). Again we want G(x) — G(x + e) > 0; we have 



4 



G(x)-G(x + £)=}_^(1 
i=i 

=e-"n'5i£'+0(£^)) 



+ l-e- 



'(1 



') 



+ e-^{e + 528^ - (l/2)(£ + 52e^f + 0{e'') - e - (l/2)£2 + 0{e'')) 
+ e-"H(54£' + 0(£^)) 
=£2(5ie-"i + (^2 - l)e-" + 54e-^^ + 0{e)). 



(2.14) 
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Thus, by taking e sufficiently small, (|2.14p will be > if 

<5i(0)e-"i + (52(0) - l)e-" + 6-^^4(0) > 0. 
In light of the formulas for the Si(Q) and the fact that A < 0, this is equivalent to 

r(x) := -{x - x^fe-'"'^ + (A - {xi - Xi){xi + 2:4 - 2x))e-^ + {x - xife-""^ > 0. (2.15) 
Now multiplying r(x) by and using the inequalities 

<- 1 _ (2;^ _ 3.) ^_ (2;^ _ 2.)2/2, e="-^* >1 + {X-X4) + {X- X4f/2, 

we get 

e^T(x) = {x- X4)2(l - + {x - x^f{e''-'"' - 1) + A 

>{x-xa) l(a;i-a;) ^ \ + [x - xi) I (a; - X4) H ^ / 

= (a; — X4)^(xi — x) + (x — Xi)^(a: — X4) + {x — Xi)(x4 — a;i)(a;4 — x) 
= 0. (!) 

Therefore (I2.15P holds, and thus as before, x cannot be a minimizcr, and this concludes Case II. 
This only leaves the possibility that the minimizcr x of G is of the form 

Xi — X2 ^ ■ ■ ■ — Xj^ > Xjy^i > Xi,^2 — ■ ■ ■ — 

and thus that the niinimizer r of Fq{k) over D{c2, C3) is of the form p.4p . for any k. This concludes 
the proof of Proposition [2?3l □ 

2.2 Identifying and iterating a likely recurrence inequality 

Let 

fc* := c^^lar^n, (2.16) 

with £ coming from \1A\ . (fc* is meant to be an integer, as is another parameter ki defined later, but 
for simplicity we omit the "integer part" notation.) This A:* will serve as a threshold separating the 
"early" states {B{t) > fc*) from the "late" states {B{t) < fc*). So, in light of the informal discussion 
in the introduction, "/c* — 0(03"^)" should be more or less expected; the need for an additional factor, 
In^^ n, will become clear later, in Section 3. 

Our immediate task is to identify a function 4'p(fc), such that, intuitively at least, the random 
sequence {B{t)} whp satisfies a recurrence inequality 

S(i + 1) < *p(S(i)), if B{t)>k,. (2.17) 

Then, for k > k^,, 'l'p(fc) needs to be large enough so that, conditionally on {B{t) = k}, the event 
{B{t + 1) < *p(-B(t))} is very likely. Since we think that ^p{k) defined in (P?T|) is a "sharp" 
conditional predictor of B{t + 1), we must have '^p{k) > ^p{k). Also, to be of any use, '^p{k) must 
fall below k. Last, but not least, we must be able to solve a chosen recurrence. The function 

*p(fc) = + %{k)) = 1 (fc + n - Fp(fc)) , (2.18) 

with Fp{k) as defined in (|2.2p . happens to meet all three requirements. 
Define an event 

A {Vt : B{t) > fc* =^ B{t + 1) < *p(B(i))}. 

In Section 2.5 we will show that P(A) ^ 1 as n ^ 00. In this section, assuming that the event A 
holds, we solve the recurrence (|2.17p and estimate sharply T(k^.), the first moment t when B{t) < /c*. 
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Lemma 2.4. On the event A, 

T{k^,) < Inn = 0(02""'^). 

Proof. The proof is divided into two cases. 

Case I. C2 > 2n^^. First of all, by Proposition [^21 we have 

*p(fc) <*e(c.)(fc) Vfce[n], 

6(02) being the topheavy distribution 62, ■ ■ ■ , O2) with parameter C2. Therefore on the event A, 

B{t + l)<-^e{c,){B{t)), yt<T{K). (2.19) 

Let us bound 5'e(c2) from above. Since C2 > 2n~^, we get, using (|2.3p . 

.1/2 

n — 1 

By (11201) and 



>icV2 ^ e,<l^Sm2^, (2.20) 



vl/e(i?(t)) - i(S(t) +n- e-«i^(*) - (n - l)e-''^^W), 



we have (using the inequality e ^ > 1 — a;) 



B{t + 1) < ^ (S(^) + n - - (n - 1)(1 - e2B{t))) 

<\(B{t) + {l-cy'/2)B{t) + l) (2.21) 

< |^l-^^B(t) + i, (2.22) 

a linear recurrence inequality. (Implicit in this derivation is an intuition that, for the distribution 
6 in question, a large enough proportion of collisions happen in box 1, and that we may disregard 
collisions in boxes 2, . . . , n without inducing too large an error.) It follows from (|2.22p and -B(O) = n 
that 

/ 1/2 \* 

-B(t)<n 1-^1 +26^^'^ ioi- B{t)>K. (2.23) 



To get a bound on t(A;*), notice that 



l/2\ ^C^.)-! 
^1 ^0.-1/2 



< B{T{h) ~ 1) < n - ^ j +20^''' (2.24) 

and let t := T{k^) — 1. Now C2 = o(/c*) if C2 — o(ln^'^^ n), which is certainly implied by (|1.4p . So 
we can crudely use the bound 2c2 < (l/2)/c* in (|2.24p to get 

(l/2)/fc, < n(l - c}J^/AY < nexp (-C2^V/4) . 

Taking logarithms and solving for t, we obtain 

T < 4 (In n + In C2 + e In In n + In 2) C2 """^^ < Scj "'"^^ In n. 

Hence T(k^) = o{c2^), since C2 = o(ln^^ n), which is the first condition in p.4p . 

Case II. C2 < 2?!^^. This time 9{c2) is too close to being uniform, and the inequality (|2.2ip is 
too crude. A bit of reflection shows that we should not expect that B{t) decay exponentially here. 
We show instead that, for some absolute constant A, 

Bit)<^, t<T{k,). (2.25) 
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The proof is by induction. The case t = holds if A> 1. Suppose (|2.25p holds for some t. Observe 
that 

n ^ 

and that ipi^) is increasing and concave down. Then by the inductive assumption, 
So we need to find A > 1 such that 



^[j^] <A/it + 2), (2.26) 



or, defining a; :— A/{t+ 1), 



^(.)<-^ ^l-e-^<.(l-^). (2.27) 

We therefore define x{t) imphcitly by 

l_e-(*)=x(i)(^l-^); (2.28) 

by considering the graphs of the functions of x{t) on the left- and right-hand sides of l|2.28p . it is 
clear that x{t) is well-defined and decreasing (to 0) in t. Therefore (|2.27p is satisfied iff x > x{t). It 
is not difficult to show that 

4 

and so A^, := limsup(^oo(i -I- l)x{t) is finite. Thus to satisfy (|2.26p and thereby to complete the 
inductive proof, wc can pick A — max{l,v4»}. 
Therefore, on the event A, we have 

K < b{t{K) - 1) < 

which we can invert to get 

Ati 

TikA < — = 2Aln^n, 

which is certainly o(c2 ^^"^ In n) . Combining this with the case C2 > 2/n, we have T(fc, ) — 0(c2 In n) 



on the event A. This completes the proof of Lemma vlM □ 



2.3 Exponential tail bounds 

We need to show that -P(A) converges to 1, and that it does so sufficiently fast. 

To this end, and also for Theorem 11.21 we establish two-sided exponential tail bounds for for 
the distribution of Bit + 1) conditioned on B{t). Like Chernoff bounds for sums of i.i.d. random 
variables, the bounds are based on a generating function approach for estimating the probabilities 
of large deviations. 

Let TTfeb :— P{B{t + 1) = b\B{t) = k), and note that TTkb = for k < b. Introduce the tail 
probabilities 

7r-^ = P{B{t + l) <b\B{t) = k) and 7r+ = P(B(i + 1) > 6|B(t) = fc). 
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Theorem 2.5. 



T^kb — 3%/fcexp 



T^kb - 3\/fcexp 



(^p(fc) - b)' 
2k 



2k 



b < %ik) 



6>$p(fc). 



(2.29) 



(2.30) 



Proof of Theorem \2.5\ The heart of the proof is an expression for n^b by means of generating func- 
tions. As usual, the expression [?/'"]/(?/) denotes the coefficient of in the power series expansion 

of/(y). 



Lemma 2.6. 



Proof. 



nkb = ^[x'z']\\{{l + z{e-^^^~l)) 



Kb < k < n. 



ntb^ P{B{t + l) = b\B{t) = k) 

= P[k baUs go into exactly the boxes indexed by U) 

(7C[n], \U\=b 

= P(each box j from U gets £j balls) 



U eiH h£6=fe 

ej>0 Vje[h] 



U e 



£l£2 ■ ■ ■ £b 



J6C/ 



^^EEn^' 

1/ r jey ^' 



Now we build a bivariate generating function for the probabilities iTkb- Start with the k index. 
Incorporating for future convenience an n'^ factor, we have 



E 



n 



EE E n 



TTfeb— a; = 

k ' U k eiH heb=kjeU 

£i>0 



We merge the second and third sums, yielding for the right-hand side 



EEn 



U ej>0 jeU 

Reversing the order of summation and multiplication, we get 

k 



rl 



U j€U E=l 

=E Yli'^^pi^pj^) - 1) 
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Multiplying by z and summing for & > 0, wc obtain 



h k ' b \U\=bjeU 

2'' ]^(exp(7ipja;) - 1) 

uc[n] jeu 

= X! n 2:(exp(npjx) - 1) 

UC[n]jeU 
n 

= Y\.[^ + z(exp(npjx) — 1)] . 



Therefore we have 



and from here the lemma follows. □ 



By Lemma 



fc! 



gk{z) := = k] = Y,^kbz' = ^[x'] + z{e^P^^ - 1)). 

Now, for b < k, we have 

b 6 

J2^ktz^ < ^TTktz" < gk{z), 0<Z<1, 



Z TT 



Z^ 



kb 

i=l i=l 
k k 

kb ^ Y '^ktz'' < ^ TTfejz' < gk{z), z>l. 

i=b+l i=b+l 



This gives 

n^,<^^ -T^ix"] + ^(e"^^" - 1)), for < z < 1, (2.31) 

- ^ = ;^[-'] + -(^"^^^ - 1))' fo'^ - ^ 1- (2-32) 

Since the coefficients of the products in (|2.3ip - (|2.32p are nonnegative, we use the inequality \x^\f{x) < 
f{x)/x'' (V a; > 0) to obtain 

-kb<^bT^Ei^ + ^(e^''^''-^)), Vr>0, < z < 1, (2.33) 

^,\<p;7^n(l + ^(^"'^^-l))' ^'^>0' -^1- (2-34) 

Our task is to get the most out of these bounds (|2.33p - (|2.34p by choosing values for z and r judiciously. 
We use Stirling's formula fc! < 3\/fc(fc/e)'^ to transform the product-type formulas (|2.33p and (|2.34|) 
into 

TTfef, < 3%/fcexp(iJ(z,r,6)) , z < 1, (2.35) 
T^tb < sVfc exp(i? (z, r,b)) , z > 1, (2.36) 
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where 

H{z,r,b) :=fcln( — ) - 61n(z) + V ln(l + z(e"P^'' - 1)). (2.37) 
V rne I ^-^ 

^ ' j 

For a given 6, we want to use a stationary point of H{z, r, b), i.e. a solution to 

H,^~- + Y ^=0, (2.38) 

j 

i^.--^ + ^E i /T:J (2.39) 

r 1 + z(e"-Pi^ — 1 

j 

This complicated system has a simple solution (z^,r^) = (1,/c/n) for 

j 

Moreover, from (|2.37p it is immediate that H{z^, r*, 6*) = 0. This is a first sign that the inequalities 
(|2.35|) - (|2.36|) may indeed lead to meaningful explicit bounds for tt^^. Of course, we need to know 
that (|2.38p - (|2.39p has a solution (z, r) for 6 7^ 6, as well, such that z < 1 for 6 < 6*, and z > 1 for 
b> K. 

Observe that 

det 2;; hZ ) = - > 

for every solution (z, r) of (|2.38p - (|2.39p (see Lemma [A. II in the appendix for a proof of this). So, by 
the implicit vector function theorem, there exists an infinitely differentiable solution (z(6),r(6)) of 
([2:38l) - ([2:39)) . such that 

(z(6*),r(6*)) = (z*,r*). 

Moreover, z(6) is strictly increasing (see (jA.13|) in the appendix), so that indeed z(6) < 1 for 6 < 6*, 
and z(b) > 1 for 6 > 6*. So, introducing 

hib) = H{z(b),r{b),b), (2.40) 

we have 

TT^Tf, <3Vfcexp(/i(6)), 6<6,, (2.41) 
TT^f, <3\/fcexp(;i(6)), b > b,; (2.42) 

here /i(6,) = H{z^,,r^,,b^.) — 0. 

To get efficient bounds from (|2.4ip - (|2.42p . let us approximate h{b) by its Taylor polynomial 
about 6,. First, using ((238)) - (IOQ]) . 

/.'(fe) = ^i/(z(6),r(6),6) 

= i7,(z(6), r(6), 6)z'(6) + i7,(z(&), r(6), 6)r'(6) + Hb{z{b), r{b), 6) 

= Ht(z{b), r{b), 6) = - ln(z(6)). (2.43) 

It follows that h{b) is unimodal (concave down, in fact), attaining its zero maximum at 6 = 6*. 
Consequently 

h"(h) h"(h) 
h{b) = h{b^) + h'{b^){b - b,) + (6 - 6,)2 = (6 - b,f, (2.44) 

6 being between b and 6*. It is shown in the appendix (Lemma IA.2P that 

h"ib) < -i. 

This bound and jMl]), (12321), ^i2A^ imply ([2:291) and ([SjQl), thereby concluding the proof of 
Theorem ED □ 
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2.4 Using the exponential tail bounds 

For the upper bound, we will need (|2.30p . which gives, for b = ^'p(fc) (recall the definition (|2.18p of 
*p(fc)), 



= S'v/fcexp 



(*p(fc)-<i>p(fc))^ 



2k 



8k 



(2.45) 
(2.46) 



Introducing 

we rewrite (I2.45|) as 



Hpik) :--(*p(fc)-$p(fc))^ 



^U^k) - 3^exp (-^Hp{k) 



(2.47) 



The next Lemma states, roughly, that the larger k is, the more likely it is that the next state B{t+1) 
is close to the prediction based on information B{t) — k. 



Lemma 2.7. For all k and p, Hp{k) is increasing in k 
Proof. Define 

iVp(fc) := *p(fc) - $p(fc) 



- I fc-n + Ve-P^M 



so that Hp{k) = k-'^Np{kf. Then 

K{k)^^^[2kN^{k)~Np{k)], 



where 



because 



1 " 

2kN'p{k) - Np{k) ^-Y^ {p,k + 1 - i2p,k + l)e-P''^) > 0, 



fix) -.^x + l- {2x + l)e~=" >x + l 
This completes the proof of Lemma 12.71 
Lemma 2.8. For all k E {fc*, . . . , n}, 



2x + l 



x+l x+1 



> for X > 0. 



□ 



Hp{k) > lni+^ n, 

where is some absolute constant. Thus the probability in (|2.47p is superpolynomially small. 

Proof. Consider first the case C2 > 2n^^. From Proposition 12 . 31 and Lemma [2?7l it follows that, for 
ah k > k^, Hp{k) > Hp{k^) > H,,{k^); here 

Hr{k,) = (*r(fc*) - 4'r(fc*))Vfc* 

71 



1 

ifcl 



> 



4fc, 



(2.48) 
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To bound (|2.48p from below we need to have sharp bounds for ri and i^. RecalUng the definition of 
r in Lemma |2.3[ and letting fi := n — v — I, we have 

vri +r2+ Atrs = 1, vrf + rj + /irg = C2, vrf + + /xrf = C3. (2.49) 

Obviously r^, < n^^. Since we assume that C2 > 2n~^, we also have 

2 4 

C3 > C2 > -IT. 

Hence 

'''^^ ^ n2 - n - 2 ' ^"^3 < ^3 =^ ^2 ^ 4 • 1^-50) 
Combining (|2.49p . (|2.50p and r2 < ri we get 

? < J^r^ < C2, ^ < J^r^ < C3. (2.51) 
4 8 



These double inequalities imply directly that 



3 C3 , C3 1 Co 64 Co ,„ 

o— <?'i<4— , :rr4<J^<^-|- 2.52) 
S C2 ~ C2 64 c§ - - 9 c§ ^ ' 



Armed with (|2.52p we return to (|2.48p . Recalling that fc, = C2^^1n~^n, we need to consider 
separately the subsequences {n,} such that rifc* ~ 0(1) for n G {ni}, and the subsequences {n-i} 
along which rifc* ^ cxd. In the first case 

mf 5—5 > d > 0. 

So, using (P^Tjl . we obtain from (P^ : 

i/r(fc*) > ^^V^fcJ > ^(.r?)2c2-^ln-^^n > ^^^c^' In''' n. 
In the second case, for rii large enough. 

So, using (P3T|) again and (P3^ . 

16 16 2^^ C3 

Therefore there exists a constant A> such that, for n large enough, 

i?r(fc*) > Amin [c^^ In"^^ n, ^ ln~^ n 



for if there weren't, there would be a subsequence {''ifc*(fT-j)}j>i along which this did not hold. This 
subsequence would then have a further subsequence which tended to infinity, or remained bounded, 
which would contradict the bound for one of the two cases established above. Since 

^ 3/2, -(l/2+£) ^ r<^ -2 

C3 < C2 m ^ ' ' n, C2 < C m n 
for some large enough constant C, the last inequality leads to 
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for n large enough, as long as e < 1/4, which of course we may assume without loss of generality. 
Consider now the case C2 < 2n^^ . This time, by Proposition 12.11 and Lemma |2.7[ for all 

Hp(fc) > ^^p(fc*) > H^{K), 



fc > fc, = C2 ^ In ^ n 



where 

H^iK) = ^ (fc. -n + ne-W.)^ > 3_ > ^nln^n. 
(For the first inequality we used 

e---l + a;> y-^> y, for (0,1).) 
Thus Hp{k) = LJ (ln^+'' n) for this case, and this concludes the proof of Lemma [2?8l □ 
2.5 Validation of the deterministic approximation 

Now that we have established the (superpolynomially) small bound for tt^^ ^^-^ , we can finally show 
that the event A is extremely likely indeed. 

Lemma 2.9. For some constant A> Q, 

P(A) > 1 - 12nlnncxp(-ylln^+^n) > 1-?^"^', Vif > 0, n>n{K). 

Remark. Borrowing a term from Knuth et al. [10], the event A holds quite surely (q.s.). 

Proof. Introduce the events 



Ct = |B(s + 1) < *p(S(s)) V s < t, B{t + 1)> «'p(S(i)), B{t) > fc, 

that is, Ct is the event that the recursive inequality B{s + 1) < ^'p(i3(s)) is violated at a state 
B{t) > fc*, and t is the first such moment. Clearly 



A^ = y Ct. 

Let us show that 

Ct = 0, Vi > := 4c2"^/^lnn. 
Suppose that on the contrary Ct^ 7^ for some ti > Then, by the definition of Ctj^, we have 

B(s + 1) < *p(B(s)), Vs<i*, 

and certainly B{t^,) > fc*. However, using this recurrence inequality exactly as in the derivation of 
(|2.23p . we must have 



(4c-/^) (Inn) (cf/4) 



B^ , 

< n exp 



— 1 ^ ^ ^* 



since C2 — o (in ^'^n). Contradiction! Thus A^ is a union of at most events Ct. Now by p.45p 
and Lemma 12.71 we have 

P{Ct) < 3ni/2exp(-i?p(fc,)/2), t > 0, 
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since B{t) > on Ct- Therefore 

PiA^) = p(\JCt)<Y.PiCt) 



< 3t,n^/^exp(-iJp(fc,)/2) 
= 12c~^/^i^/2lnnexp( - Hp{K)/2) 

< I2n In n exp ( - i7p (fc* )/2) 

< 12nlnnexp ( — 0.5A„ \n^^^ nj by Lemma [ 

and from here the lemma follows. □ 



This completes a program we put forth at the end of the introduction. Combining Lemma 12.41 
and this last Lemma l2.91 we have proved the following. 



Lemma 2.10. Let fc^ = C2 ^ In ^ n for some e G (0,1/4) however small, and let T{k^,) denote the 
random moment when B{t) falls to or below fc» for the first time. Then, for some constant a > 0, 

P{T{k^) <ac^^^^\nn} >l-n-''^, Vif > 0, n>n{K). 

In short, q.s. r(/c*) — o{c2^)- 

2.6 Bounding the expectation of r(k^) 

Even though T(fc*) = o(c^^) q.s., proving that i?[r(fc*)] = o{c2^) as well is not straightforward, since 
we do not have a polynomial (worst-case) bound for r(A:*). As a first step, introducing the event 
indicators /a, Ia-^i we split E[T{k^,)] using 1 = /a + I A" and bound the second summand via the 
Cauchy-Schwarz inequality: 

E[T{k,)] = E[T{k,)lA] + E[T{k^)lA^] 

< 5(lnn)c-^/' + ^E[THk.)]-E[ll.] 

< 5(lnn)c^'/' + ^E[T^k,)] ■ /P(A^. (2.53) 

It remains to show that i?[T^(fc*)] is at most polynomially large. To do so, introduce Tk, the random 
time the process {B{t)} spends at state k, i.e. 

Tk = \{t>0 : B{t) = k}\. (2.54) 

Then 



/ n \ 2 „ 

\k=k,+l ) /c=fc.+l 



again by the Cauchy-Schwarz inequality. Therefore 

n n 
A;=fc,+1 fc=fc,+l 

Recalling the notation TTfefe = P(B(t + 1) = fc | B(t) = fc), we observe that 

P{n - i I Tfe > 0} = 7r^,^^(l - ^fefe), 3 > 0, 

i.e. conditioned on {Tk > 0}, Tk is geometrically distributed, with success probability 1 — iTkk- In 
particular, 

E[Tk I Tfc > 0] = , Var[Tfc | Tk > 0] = 
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It is obvious intuitively, and can be easily proved, that nkk decreases with k: the larger the number of 
balls — the larger the probability of collision. Consequently both conditional moments of Tk decrease 
with k. So 



E[T^{k,)] <n 



T^kk 



k=k,+i (l-7rfefc)2 

k=k. + l ^ 

< TT^^^- (2-55) 

Therefore, it remains to show only that ^ is at most polynomially large in n. Using the 

simplest lower bound for the probability of the union of events, via the inclusion-exclusion formula, 
we write 

1 — TTfcfc = P(there is a collision during a fc-allocation) 

> P(balls a and b collide) — P(c, d collide and e, / collide) 

{a,fc}c[fc] {c,d}#{e,/}c[fc] 

k 



„ , C2 1 - flifc a2k C2 . (2-57) 

2/ V C2 J 

where ai, a2 are some absolute constants. The two rightmost terms of ()2.56p are due to the fact that 
there are two ways in which two distinct pairs of balls can collide: either the two pairs overlap at 
one ball or they are disjoint. Now introduce fci := In^^^^ n; clearly fci < A;*, and we also have 

limfci— = 0, limA:^C2 — 0. 

C2 

So, by (|2.57p . uniformly for all fc < fci, 

-^<7^(l + o(l)). (2.58) 

l-TTfefc Q)c2 

Consequently 

< = 0(fcr'c2-^) = 0(ln^/' n\ (2.59) 

which is sufficient. Thus by (|233ll and ((2?55)) . 

E[T{k^)] = 0{c^^'^ Inn) = o{c-^). (2.60) 



3 Bounding expected duration of a middle phase and a late 
phase 

To complete the proof of Theorem 11.11 it remains to bound the expected duration of the process 
after the number of balls has dropped below fc» . 

We define a middle phase as [r(A:^), r(fci)), the time interval during which the number of balls 
is below fc* and above fci. Using the Tk defined in (|2.54p . we have 

i?[r(fci)-r(fc,)]= ^t^'']^ E 

k=ki + l k=ki+l '^^ 
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Then, by decreasing monotonicity of (1 — nkk) ^ and (|2.59p . 

1 — T^ki fci 

(The last computation explains at long last why we needed the In^*^ n factor in the definition (|2.16p 
of fc,.) 

Naturally, we define a late phase as [T(fci), t(1)]. By (|2.58p . 

E[ril) - r(fci)] < 1 + E = 2^2-^1 + o{l)), (3.2) 

fc=2 ^2(2) 

where we used 

1 11,^ 

— fc > 2 

fc(fc-l) k-1 fc' 
That does it! Adding the estimates (|2.60p . (|3.ip . and (|3.2p . we obtain 

i?[T(l)] <2C2-\1+0(1)). 

4 Lower Bound 

We now provide the matching lower bound for Theorem ll.il We will not need assumptions on p as 
strong as (|1.4p : rather we will simply assume that C2(p) — > 0. 

For any m < n, let T{m) denote the coalescence time for the process starting with m balls (we 
have so far been considering the case T :— T{n).) We start by stating Theorem 2 from [1]: 

For any p, i^[T(TO)] > 20^ ^ f 1 - 1 - ^^^^ ' ^) . (4.1) 

V m 12 C2J 

To get a bound on T{n), first note the obvious- looking fact that 

Proposition 4.1. Let mi < TO2 < n. Then for any p, T{m2) stochastically dominates T{mi). 

Proof. This is a result of the following basic coupling argument: start with m2 balls, mi of which 
are marked. Then perform the usual allocation process; the time X at which all balls coalesce is 
distributed as T(m2), and at this time certainly all marked balls have coalesced as well; call the 
time that these marked balls have first coalesced Y, so that X >Y and Y is distributed as T(mi). 
From here the result follows. □ 

This result shows that for the bound E[T{n)] > 2c2"^(l — o(l)), it is sufficient to show E[T{m)] > 
2c2"\l-o(l)) for some m < n. By the inequalities 

2 ^ ^3/2 ^ / "^3 ^ 1/2 

C2 < C3 < C2 =4> C2 < — < C2 , 

C2 

we have that C2 ^ iff C3/C2 0. Let m, := (02/03)^^'^, say; then — * cxi since we assume that 
C2 ^ 0, and so (|4.ip becomes 

E[T{m^)] > 2c2^ (1 - — - 0{mlm-^) \ = 2c^^{l - o(l)), 
\ m^ J 

and the result follows by Proposition [ITT] The proof Theorem II. II is complete. 
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5 Proof of Theorem 11.21 



We start by restating Theorem 11.21 in a more detailed manner. 

Theorem 5.1. Let bo — n. Suppose C2 = (In^^n)A(n), where \{n) 
p = 6(02), i.e. p is the topheavy distribution {61, 62, ... , ^2) with 



00 however slowly. Set 



el + {n - l)9l ^ C2. 



Then for the process evolving according to this p, whp 
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so, in particular, E[t{1)] = uj ^) . 



Remarks. 



1. Our choice of p should be expected. Indeed, the recurrence inequality (|2.19|) signals, intuitively, 
that the coalescent process for p — 9{c2) is a good candidate for being the slowest among all 
p with J2jP'j = C2. 

2. In Theorem 5 of [IJ, Adler et al. had proved that for maxj pj bounded away from 0, and the 
remaining pj uniformly small, the expected coalescence time exceeds c^^ by a factor of Inn. 

Proof of Theorem \5.1i For simplicity we let c :— C2. For p — 9(c), the Markov chain is almost as 
simple as that for the uniform p. Indeed, given B{t), the number of balls that land in box 2, . . . ,n 
(call it B{t)) is binomially distributed with parameters B{t) and success probability 1 — 9i, i. e. 
B{t) = Bm{B{t), 1 — di) in short. Conditioned on B{t), we have a uniform allocation of B{t) balls 
among n — 1 boxes 2, . . . , n. And, for B{t) sufficiently large, whp B{t) ~ (1 — 9i)B{t). So, based on 
our experience with deterministic approximations earlier in the paper, we should expect that — after 
fusing balls that landed in the same box — these B{t) balls give birth to about 



balls for next generation. All the balls that landed in box 1, if there are any, will coalesce into 
one ball. Ignoring this box for now, we expect then that the process {B{t)} whp "closely" obeys a 
recurrence inequality of the form 





ip{x) :— 1 — e 



Here is a precise claim. 

Lemma 5.2. Let 7 := 1 — 2c^/^ < 1 — 6*1 and introduce 




Then, for n .sufficiently large, 



P{B{t + l)<^{B{t))\B{t) 



(5.1) 



Remark. 7/(0) — 0, ri{x) < (p{x), and ij{x) is increasing for x < ln3. 
Proof of Lemma \5.2\ Notice first that 



B{t + i) > 7^(^ + 1) 
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where TZ{t + 1) is the number of boxes among 2, . . . , n that host at least one of B{t) baUs. Denoting 
P{{-}\B{t) = k) by Pfc({-}), we have then 

Pk{B{t + 1) < *(fc)) <Pk{n{t + 1) < *(fc)) 

<Pk{n{t + 1) < > 7fc) + Pk{B{t) < jk). 

Now, denoting the c.d.f. of TZ{t + 1) conditioned on {B{t) — j} by Fj, we have: for ji < j2, 

F,Jx) < F,,ix), Vx>0. 

(InformaUy, the fewer baUs we aUocate among the boxes 2, . . . , n, the fewer nonempty boxes we end 
up with.) Therefore 

Pk{n{t + 1) < vE'(fc), B{t) > jk) = J2 ^(^(^ + 1) < *(^) I ^ ^''•(^W ^ 

j>7fe 

<p(7^(^ + 1) < vi/(fc) I = \^k^ ) ^ = j) 

<P{TZ{t + 1) < *(fc) I B{t) = [7fc] ). 

Consequently 

Pk{B{t + l) <^ik))<Vi+V2, 

Vi :=Ffc(7^(^ + l) < = [7^:1), 

V2 -.^PkiBit) <-fk). 

Using the Chernoff bound for the tail of binomial distribution (see Mitzenmacher and Upfal [IT] , for 
instance), we have 



V2 =P(Bin(fc,l-6li) <-fk) 
kpS'^' 



< exp 

<g-fcc/3 



p=l-9-i, 5=l-7(l-ei)-i 



(5.2) 



1 /2 ______ 

as 9i ~ C2 , from the definition (|2.3p of 61, 92- 

Turn to T'l. Applying Theorem 12.51 to the boxes set {2, ... ,71} and the uniform distribution u 
on this set, we obtain 

(^u(7fe)-^(fc))' ' 
27fc 



Pi < 3\/ 7A; exp 
Here, using the definition of ri{x), 

^uilk) - '^{k) =(n - l)ip 
(n-l) 



jk 
n — 1 
jk 



n- 1 



(n — l)ri 

7A: 
n-1 



7/0 
n — 1 



So, as 



we have 



x"^ x'^ x'^ 



X - ^{x) = e " - 1+ X > y - y > y , X G [0, 1], 



Pi < 4\/7fcexp 



(7A:)^ 



72-fk{n- 1)2^ 

The estimates (|5.2p - (|5.3p imply (|5.ip . thereby completing the proof of the lemma. 



(5.3) 
□ 
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To continue, let k :— r?l^ . Introduce two events, 

r {Vt, B{t) >k=^ B{t + 1) > 

and 

n := {3t > : Bit) > k}. 

On the event F, B{t) does not decrease "too quickly" as long as B{t) is above k. On the event 11, 
T(fc) > c~^/^, i. e. T(fc) = uj (c~^)- Then, by Lemma [??^ 

p(r" n n") <p } [j {B{t + 1) < ^'(b(^)), B{t) > fc} 

<(1 + C-3/2) (^e-W3 + ^-P/73n^^ 



i. e. p(run) ^ 1. 

If we show that T{k) — uj (c^^) on the event F as well, we will be able then to claim that whp 
T(fc) — Lo (c~^), and the proof of Theorem 15. II will be complete. 
To do so, we observe that on the event F, 



as long as 



x{t + l)>iiMt)), x{t):=^, x{0) = 1 - (n - ir\ (5.4) 

n — 1 



x{t) > ^ n-'^\ 

n — 1 



Lemma 5.3. Let n > 3. Under the recurrence (j5.4p . 



2 T* 

2;(i)>--^. (5.5) 

Proof. The argument runs in parallel to that for the lower bound of B{t) in (|2.25p . The base case 
t = is just 

a:(0) = 1 ~ n-i > I 

Suppose (|5.5p holds for some t. Since ri{x) is increasing for a; < In 3, (|5.4p implies that 

/ 2 7* \ /2 7*+i 



3t+ 1/ ' \3t + l 
So, to complete the inductive step, we need to show that 

t + I 2 7*+-^ 

Define z as a root of 

or l-e-^ = z^ii^. (5.7) 

' t + 2 t + 2 ^ ' 

Since 1 — e^^ < 2/3, equation (|5.7p has a (unique) root z = 2:(t) for t>0. Using 1 — e^^ > z — z^/2, 
we obtain 

4/3 
z{t) > ^— . 
^ ^ - < + 2 

Inequality (|5.6p holds if y < z(t), which is certainly so because 

yit)<^<^, vt>o, 

and from here the lemma follows. □ 
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Thus on the event T, 



B{t)>k^Bit + l)>in-l)^^. (5.8) 



Lemma 5.4. On the event T, 



20y^Mnj 20 

Proof of Lemma \5.4\ Let the event T hold. By (|5.8p . and the definition of t(-), f := t(A:) satisfies 

2 

fc > B(f) > (n-1) — . 

Recalling that k = t?!^ , 7 = 1 — 2c^/^, and taking logarithms, we get 

3ci/2^ + ln(f + 1) > ilnn, 
5 



3^V^^ + ln(T + l) > ^Inn. (5.9) 
mn 5 



It then follows that for large enough n, 



. ^ 1 In n _ _i VAH 
^ - 4 • 5 20 ^ 

as if it did not, then ln('r + 1) = o(lnn) and 



o VAR . ^ 3 

3—; r < - — - In n < In n, 

In n ~ 4 • 5 

which together contradict (|5.9p for n large enough. This completes the proof of Lemma 15.41 □ 
In summary, on the event F U 11, 



.(fc)>mm c-^c-VSUc-^. 



Recalling that P(r U IT) — > 1, we conclude that 



cE{t{\)\ > cE[T{k)] > E[T{k) /run] > P{T U H) ^ oo. 



This concludes the proof of Theorem 15.11 □ 

6 Concluding remarks, future work 

A generalization of this problem is to allow the probability of a ball going to a certain box to depend 
on its origin and not just on its destination; that is, if we have a ball in box i, it has probability 
Pij of landing in box j for any j S [n] , and these probabilities are not necessarily the same for all 
i. This is more difficult, as {B{t)}t>o is no longer a Markov chain: we have to keep track of the 
locations of the balls at any time and not simply their number. 

Coupling from the past algorithms involve running simultaneous coalescing flows on a Markov 
chain P :— {pij)i<i,j<n with stationary distribution tt, and return samples distributed exactly ac- 
cording to TT, at the time when all the flows coalesce. It would be very interesting to extend the 
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techniques in this paper to a more general case (when the rows of P are not necessarily all equal 
to some vector p) in order to obtain an upperhonnd for the expected running time of such algorithms. 

Here is an approach that appears promising. We start with a fixed allotment of one ball in each 
of n boxes, and for times we run n independent allocations for each ball, where we do not fuse 
balls that land in the same box; call this a mixing phase. This is to be taken large enough so 
that the location of each ball at time is "almost" 7r-distributed. Then at time t* + 1, we allocate 
the balls into the boxes but fuse any that collide (a fusing allocation). Continue alternating between 
mixing phases and fusing allocations until total coalescence has occurred; the coalescence time for 
this process should dominate the time for the usual process where we fuse colliding balls at every 
time. Moreover, if the locations of the balls at each fusing allocation are sufficiently independent 
and close to 7r-distributed, then we may be able to bring the results in this paper to bear on this 
more general case. 



A Missing parts of the proof of Theorem 12.51 

Lemma A.l. Let x = Hrriizz — H^z- Then x > for all (z, r, b) on the curve C defined by p.38[) 
and ([239]) . 

Proof. By the definition of iJ, we have 



H„ = — + z(l- z)n^ V ^ , (A.2) 



and 



Hrz = ny^- ^ --r. (A.3) 

(1 + z(e"P^'- - 1))2 ^ ' 

We can recast ([^35)1 and ((05)) as 

fc-V .(e"^---l) 

^ 1 + z(e"P3'' - 1)' l + z(e"Pj''-l)' ^ '' 

Using these in (|A.l|) - (jA.3p . we get that on C, 



_ 1 ,(e»P,^ - 1) (e"P,^ _ 1)2 

z2 1 + z(e'W - 1) Y (1 + ~ 1))^ ' 



H =—Y^ +z(l-z)n^Y^ ^ 

r 1 + z(e"P^'' - 1) ^ ' ^ (l + z(e"P^'^ - 1))2 



and now (jA.5p simplifies to 



1 (e"Pir _ 

- - (l + ^(e«K-_l))2 • (^-^^ 



For we can put under a common denominator and get 

n2pje"W(l + z(e"P^'' - 1)) + rz{l - z)n^p]e'^'P^'' 



r(l + z(e"Pj'' - 1))2 



24 



Using the inequality — 1 > x gives 

^ nzpje^P^^{l + znpjr) + rz{l — z)n^p^e"P^'' 
- r(l + z(e"Pj'' - 1))2 ' 

which then leads to some very convenient cancelling (in particular, of the term) to get 

- ^ r(l + zCe^W- - 1))2 • ^^-'^ 

Multiply (|A.6|) and (|A.7|) together: 



(1 + z(e"Pj'' - 1)) 



2 



and use the Cauchy-Schwarz inequality to get 



""^l^ (1 + z(e"P^'~ - 1))2 I ■ ■ 

We need to show that this is strictly greater than H^^, which can be expressed, using (|A.3[) . as 



H^, = n'ly ^/^"''^ ,,,, I . (A.9) 



Taking square roots of the expressions (jA.8|) and (|A.9[) . the condition x > is equivalent to 

^ (1 + zieP-Pi'^ — 1))2 

For (|A.10p to hold, it suffices that each summand is nonnegative (and at least one strictly positive). 
Multiplying the numerators by \frn, we need to show that V j, 1 < J < 

[(e"W'- - \)npjr{\ + ?ipjr-)e"P^n^^^ " mpj-e^P^'^ > 0. 

This is equivalent to showing that V j, 

j(np,r) > 0, (A.lf) 
where we define f{x) = (e^ — l)a::(f + x)e^ — x'^e'^^. Now 

fix) = xe^^ + x^e^^ - xe'' - x^e^ - x^e^^ 



= xe - xe^ - X = xe^{e^ - 1 - x) > for x > 0. 

Therefore the inequalities in (|A.lip hold (and at least one of them is strict), and so the lemma 
follows. □ 

Lemma A. 2. With h{b) as defined in (I2.40p . we have, uniformly for b < k, 

h"{h) < -1. 
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Proof. First note that 



h"ib) = ^ ih'ib)) = ^ (- ln(z(fe))) by ^ 



= -z'{b)/z{b). (A.12) 

To find z'{b), differentiate Hz(z{b),r{b),b) = and Hr{z(b),r{b),b) — witli respect to b; we can 
solve for z'{b) in this system (using H^b = 0, H^b = — 1/z) to get 

Ab) = = (A.13) 

z{b){H,,Hrr " H,^J z{b)x 

which is strictly positive by (IA.7|) and Lemma [A. II Now note that, using the expression (jA.6p for 
Hzz, and (jA.4p to express b, 

Hzz _ 1 z(6)(e"W''W - 1) 

7^ ^ 



6 z(fe)26^ [l + z(6)(e«w(fc) - 1)] 



2 



- z(fe)26^ l + z(6)(e"W'^W - 1) z(6)2' ^ ' 

Therefore, using (|XT2l) . jXTH, and jXTi)) . 

where the last inequality holds because < x < H^rHzz- Using the fact that b < k, ^p{k) < k, and 
b is between b and ^p{k), we obtain 

h"ib) < -i. 

□ 
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